Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Efficient video tagging
# Efficient video tagging
Videochat Flash Qwen2 5 7B InternVideo2 1B
Apache-2.0
A multimodal video-text model built upon InternVideo2-1B and Qwen2.5-7B, using only 16 tokens per frame and supporting input sequences of up to 10,000 frames.
Text-to-Video
Transformers
English
V
OpenGVLab
193
4
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase